Attention heads – side by side

Model: · Layers: 24 · Heads: 14 · Tokens: 35. Hover cells for raw values (weights after softmax).
How to read the numbers